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ABSTRACT 

We devise improved photometric parameters for the morphological classification of galaxies using a 
bright sample from the First Data Release of the Sloan Digital Sky Survey. In addition to using an 
elliptical aperture concentration index for classification, we introduce a new texture parameter, coarseness, 
which quantifies deviations from smooth galaxy isophotes. The elliptical aperture concentration index 
produces morphological classifications that are in appreciably better agreement with visual classifications 
than those based on circular apertures. With the addition of the coarseness parameter, the success rate 
of classifying galaxies into early and late types increases to ~ 88% with respect to the reference visual 
classification. A reasonably high success rate (~ 68%) is also attained in classifying galaxies into three 
types, early-type galaxies (E+SO), early- (Sa-|-Sb) and late- (Sc-|-Sdm-|-Im) type spiral galaxies. 



Subject headings: galaxies: fundamental parameters 
1. Introduction 

The morphological classification of galaxies, which 
assigns galaxies discrete classes in the form of a tuning- 
fork diagram (the so called Hubble sequence, Sandagc 
1961), allows us to quantify the basic features of galax- 
ies and to relate them to the galaxies' formation and 
evolution histories. While the Hubble classification is 
based on the visual inspection of images of galaxies, 
and therefore necessarily involves subjective elements, 
it provides a basis for many cxtragalactic studies (e.g.. 
Dressier 1980; Binggcli, Sandagc & Tammann 1988; 
Dressier 1994). 

With the advancement of digitized galaxy surveys, it 



^ Based on the thesis by CY submitted to Nagoya University 
in fulfillment of MA degree requirements. 

Department of Physics and Astrophysics, Nagoya University, 
Chikusa-ku Nagoya 464-8602, Japan: cyamauchOa.phys.nagoya- 
u. ac.jp 

■'Astronomical Data Analysis Center and Division of Opti- 
cal and Infrared Astronomy, National Astronomical Observatory, 
Mitaka, Tokyo 181-8588, Japan 

"^Institute of Astronomy, School of Science, University of 
Tokyo, Mitaka, Tokyo, 181-0015, Japan 

^Institute for Cosmic Ray Research, University of Tokyo, 
Kashiwa, 277-8582, Japan 

^Department of Astronomy, School of Science, University of 
Tokyo, Tokyo 113-0033, Japan 

School of Physics & Astronomy University of Nottingham 
University Park Nottingham NG7 2RD, England 

^Institute of Space and Astronautical Science, Japan 
Aerospace Exploration Agency, 3-1-1 Yoshinodai, Sagamihara, 
Kanagawa 229-8510, Japan 



is highly desirable to develop a fast, automated method 
of morphological classification applicable to large data 
samples, without loosing the accuracy of the traditional 
visual classification. The typical approaches employed 
for morphological classification arc to apply artificial 
neural networks (Burda & Feitzinger 1992; Storrie- 
Lombardi et al. 1992; Serra-Ricart et al. 1993; Naim 
et al. 1995; Odewahn et al. 2002; Ball et al. 2004), and 
characterizations with simple surface photometric pa- 
rameters (Doi, Fukugita & Okamura 1993; Abraham 
et al. 1994). In this paper, we focus on the photomet- 
ric parameters for the morphological classification of 
galaxies using Sloan Digital Sky Survey (SDSS, York 
et al. 2000; Early Data Release, Stoughton et al. 2002, 
hereafter EDR; First Data Release, Abazajian et al. 
2003, hereafter DRl; Second Data Release, Abazajian 
et al. 2004) imaging data. The simplest indicator of- 
ten used in the literature is the parameter that char- 
acterizes the concentration of light towards the center 
of galaxies (Morgan 1958). Doi et al. (1993) defined 
the concentration index^-* using two equivalent radii of 
the elliptic isophotes. They show that early- and late- 
type galaxies are reasonably well separated in isophotal 
photometry, if surface brightness is used as a second 
parameter. Abraham et al. (1996) introduced a rota- 
tional asymmetry parameter A in addition to C that is 
defined using the fiux measured in elliptical apertures. 
The rotational asymmetry parameter allows for cffi- 



Strictly speaking, this is the inverse concentration index. But 
we call it "concentration index" throughout this work. 
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cient discrimination between irregular and spiral galax- 
ies. The log C-log A diagram has been used as a tool to 
classify morphology of distant galaxies observed with 
the HST (e.g., Abraham et al. 1996; Brinchmann et 
al. 1998). Takamiya (1999) discusses the evolution of 
the structure of galaxies using x parameter calculated 
from the residual image, which indicates the power at 
high spatial frequencies in the disk of the galaxy. Shi- 
masaku et al. (2001) and Strateva et al. (2001) showed 
that the concentration index, calculated using circular 
isophotes as part of the standard SDSS pipehne reduc- 
tions (Stoughton et al. 2002), correlates fairly well with 
morphological type and can be used for classification 
into early and late galaxy types. The success rate of 
these approaches, with reference to visual inspection, 
is approximately 80%. 

In this paper, we use the SDSS Data Release 1 (DRl, 
Abazajian et al. 2004) imaging data of 1421 galax- 
ies selected from the SDSS Early Data Release (EDR, 
Sthoughton ct al. 2002) and early commissioning data 
to improve the success rate of morphological classi- 
fiers by introducing additional photometric parameters. 
First, we consider the performance of the standard 
circular-aperture concentration index, with particular 
emphasis on the cases where the classification fails. We 
proceed to employ the elliptical definition of Doi et al. 
(1993) and Abraham et al. (1996) for the measurement 
of isophotal and aperture fluxes and compare the suc- 
cess rate to that obtained using the circular definition. 
We also introduce a new texture parameter to help our 
classification. The motivation for this approach is that, 
in the visual classification, we mainly resort to surface 
brightness texture, such as properties of spiral arms, 
dumpiness, and HII regions, in addition to the concen- 
tration of light towards the galaxy center. The only 
work that we are aware of that discusses texture pa- 
rameters as a classifier is that by Naim, Ratnatunga & 
Griffiths (1997). They proposed "blobbiness" , "isopho- 
tal center displacement" and "skeleton ratio" parame- 
ters, which correspond to roughness, global asymme- 
tries and more localized structure, respectively. These 
parameters correlate with galaxy morphology and may 
be useful for classification, but our own tests show that 
they do not enhance the success rate of the classi- 
fier. Our texture parameter - coarseness - describes 
the structure of the outer galaxy isophotes, emulat- 
ing the method employed in visual classification. The 
flux fluctuations are computed along elliptical circum- 
ferences in order to characterize the galaxy profile's de- 
parture from smoothness. The coarseness parameter is 
defined as the ratio of the range of fluctuations in sur- 
face brightness along the elliptic circumference to the 
full dynamic range of surface brightness of the galaxy. 
In this study we evaluate the performance of the ellip- 
tical concentration index and the coarseness parame- 
ter as classification parameters (used separately or to- 
gether) in comparison to the visually classified SDSS 
sample of Fukugita et al. (in preparation). 



This paper is organized as follows. The elliptical- 
aperture concentration index is defined in section 2. 
The coarseness parameter is introduced in section 3. 
After briefly describing our sample in section 4, the 
correlation of the two parameters with the visually ob- 
tained morphology is investigated in section 5. We 
study the performance of the morphological classifica- 
tion using these parameters in section 6, and conclu- 
sions arc presented in section 7. 

2. CONCENTRATION INDEX 

We consider a concentration index for elliptical 
apertures using Petrosian quantities. The intensity- 
weighted second-order moments are defined in the 
SDSS photometric pipeline (hereafter PHOTO; Lupton 
1996; Lupton ct al. 2001) as 



(x'y), M,y EE {xy/r'), Myy EE {y^y). (1) 



If the major and minor axes of the ellipse lie along the 
X and y axes, the axis ratio a = b/a is calculated as 
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In general, we rotate the image to account for the po- 
sition angle </> from [x' , y') to (x, y) to align the galaxy 
image along the axes: 



(4) 



(5) 
(6) 



X = X cos (p — y sm qj 
y' = X sin (f> + y cos (f). 

The Stokes parameters U and Q are calculated as 

[7/2 = Ma;'y' = (A^a- - Myy ) sin (j) cos <j> 
1 — a sin 20 
^ l+a 2~ 

1 - a 

Q EE M^r'x' - My>y' = "^^^ 2'?^' 

where (p — 1/2 ta.n^^ (U/Q). The axis ratio is a = 
(1 + P)/{1 - P) with P = ^(JT^TQ^). u and Q are 
evaluated in PHOTO and cataloged in the SDSS data 
releases. 

To calculate the concentration index Ce for the el- 
lipse, we consider the area Af.{a) of the ellipse of the 
semi-major axis a and axis ratio a, and the integrated 
flux Fe (a) within A,, (a) . We deffiie the Petrosian semi- 
major axis ap for a given rj as 



{_Fe(1.25ap) - Fe(0.8ap)}/{Ae(1.25ap) - Ae(0.8ap)} 
Fe(ap)Me(ap) ' 



(7) 



where we take ?7=0.2, and the elliptical Petrosian flux 
i^p as 

Fp = Fe(fcap) (8) 
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with k set equal to 2, following the SDSS definition 
(Strauss et al. 2002). 

The Petrosian half-light and 90%-light semi-major 
axes 050 , ago are defined in such a way that the flux in 
the elliptical apertures of these semi-major axes is 50% 
and 90% of the elliptical Petrosian flux: 

Fe(a5o) = 0.5Fp, Fe(a9o) = 0.9^^p . (9) 
We define our concentration index Ce by 

Ce = aso/ago ■ (10) 
3. THE COARSENESS PARAMETER 
3.1. Definition of a Texture Parameter 

In this section we provide a detailed "step-by-step" 
method for the calculation of the coarseness param- 
eter. The position angle (f> and the axis ratio a are 
calculated in the same way as in the previous section. 
Each position (x, y) of pixel on the image is calculated 
by equation (4) beforehand, so that the semimajor and 
semiminor axes are aligned with the x and y axis. 

We consider successive elliptical annuli from a ~ aio 
outwards to ago, where a/ is defined by i^e(a/) = j^Fp 




10 15 20 25 30 
semi-major axis(pixels) 

Fig. 1. — An illustration explaining the method of com- 
puting Y for the galaxy image inlaid in the top right of 
the figure. The figure shows division of the area sand- 
wiched by the two ellipses specified by ago and aio into 
Timax annuli. The dotted lines show division into ellipti- 
cal annuli. The range of flux /range ('T') is defined by the 
90%- and 10%-tiles of the flux distribution of n-th an- 
nulus as an inlaid histogram in the top left of the figure. 
The /range computcd in each annulus is averaged over 
N annuli to give (/range), and the coarseness parameter 
Y is given as (/range) divided by the full dynamic range 
of the flux observed for the galaxy, A/. 



as in equation (9), assuming that each elliptical annulus 
has the same position angle and is congruent. We define 
the 'equivalent distance' d from the center of the galaxy 
to a pixel at (x, y) by 



Va;2 + {y/ay 



(11) 



We then divide the area sandwiched by the two ellipses 
specified by aio a-nd ago into rimax annuli, as specified in 
what follows: We calculate d of each pixel contained in 
the annulus between aio and ago, and place the pixels 
in ascending order with respect to d as 



^2, , do 



1st annulus 



2nd annulus 



3rd annulus 



(12) 

where the number of pixels contained in the n-th annu- 
lus Af„ is calculated by the equivalent distance to the 
innermost point in the n-th annulus, dm '■ 



(13) 



where [a;] is the integer part of x. The last annulus, 
which terminates at ago , does not generally satisfy the 
condition (13). We show the actual algorithm which 
satisfies equation (12) and (13) as follows: 

1) Focus on 1st d, di^. 

2) Calculate A/i by equation (13), A/i = [2nadi^]. 

3) When N\=& for an example^^ the member of 1st 
annulus is 

dii , , dig , di^ , (ii5 and dig . 

4) (i2i is automatically determined by dig (d2r is next 
to dig. That is, 7th d). 

5) Focus on 7th d, d2r . 

6) Calculate M2 by equation (13), M2 = [27rad2i] ' ' ' 
Let us take the n-th annulus, and consider the flux 

distribution. We denote the 90%- and 10%-tilcs of 
the flux distribution in the n-th annulus as /go(n) and 
/io(n), respectively (sec the inlaid histogram in the top 
left of Figure 1), and the range of the two values as 



/range (") = /9o(") " Iw{n) 



(14) 



We define the mean of /range {n) over the annuli between 



and fir. 



(/range) 



I -max 

(^^bcgin 1) 



/range("-), (15) 



where the mean is taken from the nbegin-th to the 
outermost annulus, some inner annuli being excluded. 
The exclusion of the inner radius is necessary in order 
to enhance the visibility of texture, which is usually 



If equidistant pixels are present, e.g., dig is equal to dig^^ for 
some integer x, Afi is set to 6 + x. 
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more pronounced in outer regions for late-type galax- 
ies. From trial and error, we adopted Jibogin = ['T-max/3] 
with the floor function as the best value for the perfor- 
mance of classification. 

To further enhance the signal, we subtract the con- 
tribution of the sky noise from (/range), 

/signal = \J (/rangc)^ - (2.56 • ^sky)^ , (16) 

where ^sVy denotes the rms of the sky noise. We multi- 
ply the sky noise by a factor of 2.56 so that its strength 
corresponds to the range between 90 and 10%-tiles in 
the Gaussian distribution to match our definition of 
/range- With this choicc, /signal vauishcs when the frame 
does not contain objects, i.e., contains sky noise only. 

We then divide /signal by the dynamic range of the 
image A/, i.e., 

AI ' ^ ' 

The AI is defined by 

AI = max{/9o(n)} - min{/io(n)} , (18) 

where max{} and min{} are taken from 1 < n < rimax- 
We use all annuli in the computation of A/, not just 
J^begin ^ n < riniax, which allows us to increase the 
signal contrast. 

This procedure is sketched in Figure 1 , which shows 
the flux of each pixel of a test image plotted as a func- 
tion of its semi-major axis. Note that, in the absence 
of noise, the Y parameter vanishes if the profile is a 
smooth function (such as a model dc Vaucouleurs or 
a model exponential profile), since this corresponds to 
/range = 0. On the othcr hand, if structures such as 
spiral arms are present, both /range and Y become non- 
zero. 

3.2. Image Rescaling 

The coarseness parameter thus defined may depend 
on the apparent size of galaxies, because larger images 
are resolved in finer details. To avoid this dependence, 
we set the reference size of the galaxy image, and re- 
duce the size of larger galaxies to the reference size. 
The coarseness parameter is measured for the rcscaled 
image. The number of pixels in the rcscaled image is 
taken to be that in the reference image. The reduction 
of image size takes into account both the fact that the 
rescaling factor is generally non-integer and the image 
variations in seeing, as explained below and illustrated 
in Figure 2. We exclude from further consideration 
galaxies with apparent sizes smaller than our chosen 
reference size. 

We rescale the image with the semimajor axis ago to 
the reference size image with agg^ using Gaussian func- 
tions. We assign a Gaussian function to each pixel and 
map it to the rcscaled array. The Gaussian function 



for the pixel at {x, y) is constructed so that its integral 
equals I{x,y)p, with the rescaling factor, /, given by: 

/ = ^. (19) 
ago 

The Gaussian width a^y is 



<7^y = \l Cr^^f - (o-seeing/)^ ■ (20) 

where Cref is the seeing taken as our reference and 
Cseeing is thc actual seeing of each image. With this 
procedure the PSF in the rcscaled image is standard- 
ized by varying Uxy from image to image. The Gaussian 
functions with a^y are projected onto a new array with 
the interval of pixels 

w — f X pixel size . (21) 

This projection conserves surface brightness. 



Original Image 




/= 1/2 Rescaled Image 



Fig. 2. — The method of creating rescaled images 
(/ = 1/2). Each Gaussian function whose intensity 
is I(x, y) ■ p and width is adjusted to the effective see- 
ing will be mapped to the rescaled array. The mapping 
<l,l>-|-<2,l>-|-...+<4,4> which creates the rescaled 
image conserves the surface brightness. 
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Fig. 3. — Effective ratio of sky noise fs for the rescaled 
image as a function of the scahng factor / for a given 
seeing a^y 

The use of rescaled images requires similar adjust- 
ment of the observed sky noise 



^sky fs ' 



(22) 



where fs is the effective ratio of standard deviation 
obtained from an artificial sky-noise image to which 
equivalent Gaussian smearing has been applied. The 
quantity fs, shown in Figure 3, depends on both a^y 
and the pixel size. 

4. THE SAMPLE 

The galaxies used in our test arc taken from the 
northern equatorial stripe given in SDSS DRl (Abaza- 
jian et al. 2003). The photometric system, imaging 
hardware and astromctric calibration of SDSS arc de- 
scribed in detail elsewhere (Fukugita et al. 1996; Gunn 
et al. 1998; Hogg et al. 2001; Smith et al. 2002; Pier 
et al. 2003). We use the catalog of visual morpho- 
logical classification provided by Fukugita et al. (in 



Total Sample 



Our Sample 



Hubble Type 


T 


Number 


Percent 


Number 


Ratio 


unclassified 


-1 


23 


1.3% 






E 





242 


13.3% 


194 


13.7% 


SO 


0.5, 1 


494 


27.2% 


343 


24.1% 


Sa 


1.5, 2 


309 


17.0% 


211 


14.8% 


Sb 


2.5, 3 


301 


16.6% 


250 


17.6% 


Sc 


3.5, 4 


337 


18.5% 


314 


22.1% 


Sdm 


4.5, 5 


80 


4.4% 


80 


5.6% 


Im 


5.5, 6 


31 


1.7% 


29 


2.0% 


Total 




1817 


100% 


1421 


100% 



Table 1: Morphological compositions of the total sam- 
ple and the sample after applying the size cutoff and 
dropping T——1 class. 



preparation; see also Nakamura et al. 2003) based on 
the g-band image. This catalog is based on the SDSS- 
EDR and early commissioning data, and contains 1875 
galaxies brighter than r*=15.9, where r* is the extinc- 
tion corrected Petrosian magnitude '^^ . Using positional 
matching we identify galaxies in the Fukugita et al. 
catalog with DRl objects. This allows us to weed out 
"fake" galaxies present in the catalog as a result of 
erroneous galaxy deblends in the EDR and early com- 
missioning data. This positional matching reduces the 
number of galaxies in the Fukugita et al. catalog by 
58, and then the number of the total sample is reduced 
to 1817 galaxies. The galaxies are classified into T=— 1 
as unclassified and 13 morphological types from T=0 
(corresponding to E in the Hubble type) to 6 (Im) al- 
lowing for half integer classes. In this paper, we mostly 
refer to T~0 as E, 0.5 and 1 as SO, 1.5 and 2 as Sa, 
5.5 and 6 as Im, as designated in Table 1, where the 
number of galaxies in each class is shown. 

The r-band image is used to compute the concentra- 
tion index. We use the original image without rescal- 
ing, since the seeing effects on the concentration index 
are small for our bright galaxy sample. The coarse- 
ness parameter is computed using the g-hand image 



The notation r* denotes the preUminary nature of the early 
photometric calibration of SDSS commissioning data used for 
the original sample selection (see Shimasaku et al. 2001). 
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Fig. 4. — Distribution of the semi-major axes con- 
taining 90% of the Petrosian flux of the 1817 galax- 
ies in the SDSS g-band (1 pixel = 0.4 arcsec). The 
shaded region indicates galaxies with small angular 
sizes (ago < 25pixels) and unclassified galaxies which 
were removed from consideration (396 in total). 
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because this filter is generally thought to be more sen- 
sitive to texture than r-band data (Our experiments 
have shown, however, that the use of the r-band image 
leads to very little difference in classification). We set 
the reference size of the semimajor axis to Gq^q = 25 
pixels (=10"; Gunn et al. 1998) and exclude all galax- 
ies with smaller semimajor axes from consideration. 
This selection for the reference size of the semimajor 
axis was found to be appropriate for the computation 
of the coarseness parameter, as galaxies of larger sizes 
rescaled to 25 pixels retain enough detail for effective 
classification. In addition, we exclude 23 morphologi- 
cally disturbed galaxies of class T=— 1 (unclassified by 
Fukugita et al.) from further analysis. Figure 4 shows 
the galaxy size distribution measured in the g-band, 
and the shaded areas represent the objects eliminated 
by the size cutoff and T=— 1 class. This selection leaves 
the 1421 galaxies in our sample, primarily removing 
galaxies of earlier types. Galaxies of later types (in- 
cluding irregular galaxies) generally have larger size in 
a magnitude-limited sample, and are little affected by 
the size cutoff (see Table 1). The effect of this size cut- 
off on the r-band limited sample is displayed in Figure 
5. This selection certainly causes a bias in the ratio 
among the number of morphological types, but we are 
not concerned in this paper with issues of completeness 
and statistics of the number distribution of morphology. 

We fix FWHM of the seeing at 3.53 pixels (1.40"; 
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Fig. 5. — Distribution of the r-band Petrosian magni- 
tude of the 1817 galaxies. The shaded region indicates 
galaxies with small angular sizes (ago < 25pixels) and 
unclassified galaxies which were removed from consid- 
eration (396 in total). 



the median of our sample) and the observed sky noise 
at 4.0 counts for rescaling to simplify our analysis. 
Since our sample contains bright galaxies whose ago is 
near Ogo^ and variations of the seeing and <r are suffi- 
ciently small, this simplification does not seriously af- 
fect the results. In order to assure the independence of 
the coarseness parameter on galaxy size, we rescale all 
images to ago^=25. Extending the analysis to smaller 
and fainter galaxies will require proper account of the 
image-to-image variations in seeing and sky noise. 

5. PHOTOMETRIC PARAMETERS AND 
THE MORPHOLOGICAL TYPE 

Here we investigate the behavior of the three pho- 
tometric parameters (circular and elliptical concentra- 
tion index and coarseness) as a function of the visual 
morphological type, as quantified by the morphological 
index T. We also show the effect of the axis ratio on 
these parameters. 

5.1. Concentration parameters 

We show that smaller axis ratio affects the standard 
concentration index C = r^o / rgo defined with the Pet- 
rosian flux in the circular apertures, and using the el- 
liptical ones can remove the effect significantly. 

In Figure 6 we plot the two concentration indices 
against visual morphological type index T, calculated 
by (a) conventional circular apertures (C), and (b) el- 
liptical apertures (Ce). Galaxies that follow de Vau- 
couleurs' law give C=Ce=0.29 and those with the ex- 
ponential profile give 0.44. 

The Spearman correlation coefficient with the use 
of C, r=0.781 increases to r=0.820 with Ce- Several 
significant improvements are obtained by the use of an 
elliptical aperture (Ce) compared with the original cir- 
cular aperture (C). From Figure 6, the number of spi- 
ral galaxies visually classified as T—3 and T=3.5 under 
the dotted line is smaller elliptical than circular concen- 
tration index estimates. This is expected for inclined 
later galaxy types since according to our definition less 
concentrated galaxies have larger concentration indices. 
The sample which exhibits larger difference between 
panel (a) and (b) in Figure 6 contains a significant 
number of edge-on spirals, which are misclassified us- 
ing the circular aperture, but correctly identified as less 
concentrated using the elliptical aperture estimates. A 
similar improvement is also seen for Sa galaxies (T=2). 

Thus, using elliptical aperture concentration indices 
for morphological classification increases the classifica- 
tion accuracy by making it independent of galaxy in- 
clination. In Figure 7 we present the concentration in- 
dex versus the axis ratio (smaller axis ratios are indica- 
tive of larger inclinations) for both the circular (panel 
(a)) and elliptical aperture (panel (b)) concentration in- 
dices. The regression lines are drawn for morphological 
classes of galaxies Sb, Sc and Sdm. The lines obtained 
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0.35 



0.3 



0.6 



0.55 



0.5 



0.45 



0.4 



0.35 



0.3 
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(a) r= 0.781 



■■r r 



0.25 - 



0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 
T (our classification) 



(b) r = 0.820 



0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 
T (our classification) 



Fig. 6. — The concentration indices of the 1421 galax- 
ies in our sample versus the visual morphology index 
T. In panel (a) we use the standard circular aperture 
definitions fi'om the SDSS DRl catalog for computing 
the concentration index; in panel (b) we use elliptical 
apertures. Some points were shifted to larger T values 
to avoid overlaps. The linear correlation coefficient is 
given in the top left corner of each panel. The dotted 
line is an arbitrary line placed in the same position in 
both panels to facilitate comparison. 



by regression on samples of different galaxy types in 
panel (a) , which are based on C, are significantly tilted 
with respect to the axis ratio. The concentration in- 
dices of highly- inclined late- type galaxies estimated us- 
ing circular apertures are artificially reduced to values 
characteristic of elliptical morphologies. That is, edge- 
on spirals give an anomalously high concentration of 
light if defined with circular apertures; the morpholog- 
ical classifications that use only C have significant con- 
taminants. Figure 7(b), using Cg, shows that elliptical 
apertures remedy this problem. We observe little tilt of 
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0.55 - 



0.45 - 



(a) 




0.35 - 



0.25^ ^ ° ^ / ^ 
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axis ratio 
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0.55 - 
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(b) 



7 ' ^ ' 



7 ^ 



Sc f + 



0.45 - 



0\ 



f 



0.4 - 



0.35 - 



0.3 - 



Sb 




0.25 [ 
0. 



1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 
axis ratio 



+ 1 



Fig. 7. — Concentration index versus axis ratio using 
circular (a) and elliptical (b) aperture definitions. Each 
symbol indicates visual classification T as specified in 
the legend, and the straight lines arc the linear regres- 
sion lines for types Sb, Sc and Sdm. 
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the regression lines, suggesting that Ce works better to 
classify morphologies than C; the effect of inclination 
is removed with the use of elliptical apertures. 

5.2. Coarseness parameter 

The coarseness parameter as defined in equation 
(17), is equal to the ratio of the range of fluctuations 
in surface brightness (along an elliptical circumference) 
to the full dynamic range of surface brightness. Larger 
values of Y are indicative of the presence of structure 
in the galaxy disks (e.g., spiral arms) and are associ- 
ated with galaxies visually classified as late types. Fig- 
ures 8 and 9 each display 16 galaxies with the smallest 
(y<0.00805) and the largest (y>0.260) Y parameters 
present in our sample. It is apparent that the galaxies 
shown in Figure 8 possess very weak texture and are 
classified as early types. All galaxies shown in Figure 
9 have conspicuous texture and are indeed classified as 
late (Sbc or later)-type spirals. 

It is obvious that texture of galaxies contributes to 
the numerator of equation (17). We should, however, 
emphasize the role of the denominator. When galaxies 
have conspicuous bulges, their A/ values suppress the 
Y parameter. On the other hand, the faint bulge leads 
to a small A/ value that enhances Y. A typical example 



Y=0.00600 y=0.00626 F=0.00667 Y=0.00695 
(SO;r=l) (E;r=0) (S0;T=0.5) (S0;r=0.5) 




y=0.00737 
(S0;r=0.5) 



B 

y=0.00750 
(S0;r=0.5) 



y=0. 00740 y=0.00743 y=0. 00749 
(E;r=0) (E;r=0) (S0;r=0.5) 






y=0.00753 y=0.00760 y=0.00788 
(S0;r=0.5) (S0;T=0.5) (E;T=0) 



of the importance of the denominator of equation (17) 
is a type of galaxy known Magellanic irregular. 
The texture of Magellanic irregulars is not conspicuous, 
but the overall intensity contrast is also small, resulting 
in a small denominator in equation (17) and large Y 
parameter indicative of a very late type. 

Figure 10 displays the coarseness parameter Y plot- 
ted against T. The correlation (r=0.773) is not very 
impressive compared to that for Ce-T, since the Y-T 
correlation is curved away from the linear relation. The 
important feature in this figure is a very narrow distri- 
bution of the Y parameter for 0<T<1 galaxies. The 
distribution is confined to the range 0<y<0.03, which 
is but 10% of the full variation of Y . 

The relation between the Y and axis ratio is shown 
in Figure 1 1 . The linear regression lines for different T 
arc almost flat. They do not appear as flat as those for 
Ce, but the slopes of the lines arc not caused by any 
systematic effects, i.e., Sdm is almost flat and Sc/Sb 
have opposite slopes. We conclude that Y is not af- 
fected severely by the inclination of galaxies. 





y=0.321 
(Sc;r=3.5) 




y=0.290 
(Sdm;T=4.5) 



y=0.378 y=0.336 

(Sc;r=4) (Sc;r=4) 




y=0.304 
(Sc;r=3.5) 



y=0.275 

(Sc;T=4) 




y=0.303 

(Sc;T=4) 



y=0.273 
(Sc;T=4) 




y=0.330 y=0.321 
(Im;r=6) (Sdm;r=5) 



y=0.293 
(Sc;r=4) 



y=0.269 
(Sc;r=4) 




y=0.00795 
(E;r=0) 



y=0.00798 
(S0;T=0.5) 




y=0.00799 

(E;T=0) 




y=0.00805 
(S0;r=0.5) 




y=0.268 
(Sc;r=3.5) 




y=0.266 
(Sc;r=3.5) 




y=0.264 
(Sc;r=3.5) 



y=0.260 
(Im;r=6) 



Fig. 8. — Rescaled g-band images of the 16 galaxies 
with the lowest coarseness parameters in our sample 
(y < 0.00805). The visually classified morphological 
types are shown in brackets together with the quanti- 
tative morphological index T . 



Fig. 9. — Rescaled g-band images of the 16 galaxies 
with the largest coarseness parameters in our sample 
(0.260 < Y). The visually classified morphological 
types are shown in brackets together with the quan- 
titative morphological index T . 
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Fig. 10. — The coarseness parameter Y of the 1421 
galaxies in our sample versus the visual morphology 
index T. Some points were shifted to larger T values 
to avoid overlaps. The linear correlation coefficient is 
given in the top left corner of the panel. 



6. MORPHOLOGICAL CLASSIFICATIONS 
USING C(e) and Y INDICES 

6.1. Early versus late types 

We first attempt to classify galaxies into two types, 
early (E-SO/a) and late (Sa-lm), by setting a dividing 
value for each of three parameters displayed in Figure 
6(a,b) and Figure 10. We call the two classes of the 
sample 'e' and We evaluate the completeness P 
and the contamination Q of the 'e' and as defined 

by 



P. = 



N, 



e{E+SO} 



N. 



{E+SO} 



Qe = 



N, 



e{Sa+Sb+Sc+Sdm+Im} 



-^f{Sa+Sb+Sc+Sdm+Im} 



N, 



e{E+so} 
1^,. 



, (23) 



, (24) 



*{Sa+Sb+Sc+Sdm+Im} 

where is the number of all galaxies chosen by the 
separator line, ./Vg^E+so} is the number of E+SO galax- 
ies chosen by the separator line, and A''{e_|_so-i, is the 
total number of E+SO galaxies. Other notations are 
defined similarly. 

Figure 12 shows completeness and contamination for 
the classification using the C parameter. The same 
analysis is performed in Shimasaku et al. (2001) which 
attains 80% completeness at Ci=0.35. Strateva et al. 
(2001) also reports 83% completeness at Ci=0.38 us- 
ing a concentration index with circular apertures. We 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



axis ratio 

Fig. 11. — The coarseness parameter Y plotted against 
the axis ratio. Each symbol indicates visual morpho- 
logical type, and straight lines are the linear regression 
for Sb, Sc and Sdm galaxies. 




Fig. 12. — Completeness P and contamination Q for 
the early- and late-type galaxy samples as a function 
of the division parameter Ci with the standard circular 
aperture definition. The thick solid and dotted lines 
show completeness for early- and late-types. The thin 
solid and dotted lines show contamination for early- 
and late-types. 
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find that Pe=Pi=85.0%, Qe=22.5% and Qi=9.5% with 
the use of the division constant Ci=0.339. The success 
rate is somewhat higher in our case, however, with es- 
sentiahy the same division parameter. Improved per- 
formance is seen in Figure 13 where the C param- 
eter is replaced with Ce- we obtain Pe=Pe=86.7%, 




0.25 0.3 0.35 0.4 0.45 0.5 0.55 



Cel 

Fig. 13. — Completeness P and contamination Q of 
the early- and late-type galaxy samples as a function 
of the division parameter Cei, defined using elliptical 
apertures. See Figure 12 for the line definitions. 




Fig. 14. — Completeness P and contamination Q of 
the early- and late-type galaxy samples as a function 
of the division parameter Yi. See Figure 12 for the line 
definitions. 



Qe=20.0% and ^^=8.2% for Cei =0.349. One may at- 
tain Qe=Qi=l5.0% with the choice of Cei=G.332 if the 
completeness of E-I-SO galaxies is sacrificed. 

The result of classification using the Y parameter 
is presented in Figure 14. This result shows a higher 




0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 
Ce 



(b) 




0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 
Ce 



Fig. 15.— (a) 1421 Galaxies plotted on the (Ce,F) 
plane and (b) the average 2-vector and standard de- 
viation for each morphological class. The solid line 
in panel (b) is the quadratic function least-squares 
fit to the points representing the average values for 
each morphological class. The two dot-dashed lines 
are the best classifiers into three types, E-f SO, Sa-|-Sb 
and Sc-|-Sdm-|-Im using the 2-dimensional classifica- 
tion. The dotted and dashed lines show the best one 
dimensional separators using Ce and respectively. 
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success rate than the Ce classification, Pe=Pe=^88.1%, 
Qe=18.1% and Qi=7.5% for ^=0.030. We can at- 
tain minimum contamination of Qe=Qe=^2.1% for 
Yi =0.024 with a modest cost of Pe- We conclude that 
the Y parameter is superior to the concentration in- 
dices as the morphology classifier. 

We now try to obtain the maximum performance 
using two parameters, Ce and Y, by optimising the 
choice of the dividing parameters (see Figure 15). We 
consider the position of the center of the average 2- 
vector for each morphological class, 

r — ( (^ge) \ r — ( (^eso) \ 
- \ (Ve) }-^-">-\ {Ys,) ) • 

'''' 

where (CeE) = EieE etc. With the weight 

of the number of the galaxies having the relevant mor- 
phological class given to these points, we fit 7 points by 
a quadratic function. For our 1421 galaxies, we obtain 

/g(Ce) = 2.702Cg2 _ i.4i9c^ + 0.1967. (26) 

In general, for 2-D distributions that show separate 
linear correlations for each type, the separator lines are 
parallel to the individual correlations (see Doi et al. 
1993). Wc apply this method to the quadratic regres- 
sion line. Wc then consider a set of lines crossing this 
quadratic curve at (Cei , /g(Cei)), 




0.25 0.3 0.35 0.4 0.45 0.5 0.55 



Cel 

Fig. 16. — Completeness and contamination of the 
early and late-type galaxy samples as a function of Cei 
specified at the crossing point with the quadratic func- 
tion, using the Ce-Y diagram (Figure 15). See Figure 
12 for the line definitions. 



where is a constant, and is the first derivative 
of fq. We take fi{Ce) as the dividing line for classi- 
fication. We adopt 7^=0.1, which turns out to give 
the best performance. Figure 15 shows the average 
2-vector and standard deviation for each morphologi- 
cal class, the quadratic regression fq{Ce) and the best 
separator line between the early and late type galaxies 
(lower dot-dashed line). 

The completeness and contamination curves for clas- 
sification using the Ce-Y diagram are presented in Fig- 
ure 16. The maximum success rate we achieved is 
(assuming equal completeness for both early and late 
types) Pe^Pe=89A%, with Qe=16.4% and Qf=6.7% 
for Cel =0.348, or (assuming equal contamination) 
Fe=79.3% and P£=93.6% with Q^=Q^=11.8% for 
Cel =0.337. These are remarkably high success rates 
given the fact that visual classification suffers from 
uncertainties, perhaps of the order AT ~ 1. 

6.2. Classification into three types 

We consider separation into three types, dividing 
late ('£')- type galaxies into early (~ Sa-|-Sb) and late 
spirals (~ Sc-|-Sdm-|-Im). We call the two classes of 
the sample £i and £u. We fix the first division for the 
early/late classification as determined in the previous 
subsection. We set the second division which separates 
£i and in so that the completeness of Sa-|-Sb in £i and 
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Table 2: Classification of 1421 galaxies into three types 
(E+SO : Sa+Sb : Sc+Sdm+Im) using parameter C,Ce, 
or Y. The completeness of a selection method can 
be calculated using the numbers from each column as 
follows. For the selection method using C, the com- 
pleteness of the E+SO sample is 458/537~0.85 (from 
column 1 of the C table), and the contamination is 
(124+9)/591~0.23 (from row 1 of the C table). 
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that of Sc+Sdm+Im in in are nearly equal. 

We show the results in matrices of 3 x 3 (and an addi- 
tional row and column to show the subtotals) in Table 
2 for the three indicators using C, and Y. We take 
the second division separators which divide late-type 
galaxies into early- and late-spirals to be C2 =0.436, 
Ce2=0.463, and F2=0.106, respectively, while the sep- 
arators between early- and late-type galaxies are the 
same as those quoted in the beginning of Section 6.1. 
The completeness can be read from the column by di- 
viding the number in the diagonal entry by the to- 
tal number of listed in the bottom of the correspond- 
ing column. For example, the completeness of Sa-|-Sb 
galaxies separated by C1.2 is 278/461 = 60.3%. This 
compares to 285/461 = 61.8% with the use of Cg, and 
305/461 = 66.2% with Y. The contamination is read 
from the row. The contamination in the early spiral 
galaxy sample, £1, from E+SO galaxies and late-type 
spiral galaxies, for instance, is (76 -I- 161)/515 = 46.0% 
with the use of C, (68 -I- 156)/509 = 44.0% with Ce, 
and (59 + 138)/502 = 39.2% with Y. Similarly the £u 
contamination is 20.0%, 20.2%, and 17.9%, for the C, 
Ce, and Y parameter classification, respectively. 

The classification with Y again produces the best 
result. We also note that the gain attained with the use 
of elliptical apertures over circular apertures is small for 
£1 and £11, although the Cg parameter gives generally 
better performance, if slight. One might question why 
the gain with Ce over C is rather small in contrast to 
the emphasis given in the previous section that using 
Ce parameter removed the effect of inclination. The 
reason is that early and late spiral galaxies are not well 
separated in the concentration parameter space, and 
show large dispersion with heavy overlaps. So the effect 
of inclination does not play a crucial role. In fact, visual 
separation into early and late spirals relics more on the 
opening of spiral arms and texture. 

It is important to note that contaminants from late 
spirals to the early-type sample, or vice versa, are very 
small, less than < 1%, at least with the Ce and Y 
indices. Most of the contaminants in the E-I-SO sample 
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Table 3: Classification of 1421 galaxies into three types 
(E+SO/Sa-hSb/Sc-l-Sdm-Hm) using the Ce-Y diagram. 
The completeness of a selection method can be cal- 
culated using the numbers from each column as fol- 
lows. The completeness of E-I-SO sample in the e is 
480/537~0.89 (from column 1), and the contamination 
of the e is (92-H2)/574~0.16 (from row 1). 



are from SOa and Sa galaxies. 

Finally, we examine how the performance of our 
morphology classifier improves by considering 2-dimension 
classification in the Ce-Y space. The result is shown in 
Table 3. The completeness of the Sa-|-Sb galaxy sample 
is 314/461 = 68.1%, a 2% increase, compared with the 
value found employing Y alone. The contamination in 
the £1 sample from E+SO galaxies and late-type spiral 
galaxies decreases to (53 + 137)/504 = 34.1%, which is 
5% smaller than the value obtained by using Y alone. 
The contamination in the early-type spiral sample still 
primarily arises from late-type spiral galaxies, rather 
than E-I-SO galaxies. The contamination in the £11 
sample decreases from 17.9% to 17.2%. 

7. CONCLUSIONS 

We started by examining the standard concentration 
index C = r^o/rgg, defined using the Petrosian flux in 
circular apertures, and found that the correlation is sig- 
nificantly affected by galaxy inclination. The value of 
the standard concentration index C of a highly inclined 
spiral is artificially reduced (i.e. the galaxy appears to 
have more centrally concentrated light) and the galaxy 
is misclassified as an early type. We found that this 
inclination dependence vanishes if we define the con- 
centration index using elliptical apertures. The ellipse- 
based concentration index Ce calculated from the Pet- 
rosian flux gives an inclination-independent indicator. 

In addition, we devised a new texture parameter, 
Y, that represents the coarseness of surface brightness. 
The Y parameter measures the texture of a galaxy disk 
in relation to the galaxy's overall surface brightness 
contrast (including the galaxy bulge). It is defined in a 
manner that closely mimics visual classification. Late 
type spiral galaxies (which often show distinct spiral 
arms) or Magellanic type irregulars (whose disks are 
less pronounced but so are their bulges) both have large 
Y values, in contrast to early type galaxies. We found 
that the Y parameter of a galaxy is strongly correlated 
with its visual morphology. 

We investigated the performance of the three 
different photometric parameters (C, Ce, and Y) 
for morphological classification into two (E-I-SO and 
Sa-hSb-FSc-FSdm-hIm) or three (E-hSO, Sa-fSb, and 
Sc-|-Sdm-|-Im) galaxy types. In both cases we found 
that the elliptical aperture classifier, Ce, is better than 
the standard circular aperture classifier, C. The coarse- 
ness parameter, Y", produces results superior to those 
obtained with either of the concentration index pa- 
rameters. Depending on the desired balance between 
completeness and contamination, sample completeness 
as high as 88% or contamination as low as 12% is 
achievable. 

We can further improve the classification by consid- 
ering 2-dimensional (using the Ce-Y plane) classifica- 
tion. In the case of classification into two morphological 



12 



types, this allows us to attain a 89% completeness and 
contamination as low as 12%. For classification into 
three morphological types, the completeness is 68%. 

Our newly devised photometric parameter, coarse- 
ness, provides a mode of morphological classification as 
good as the traditionally used human-eye classification. 
At the same time, it is fully automated, and thus, can 
be used quite easily for millions of galaxies. Therefore, 
the coarseness parameter presented in this paper has 
a potential to open new doors to detailed studies of 
galaxy morphology in current and future large CCD 
surveys. 
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